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comply with the Sequence Rules and that applicants must provide 
1) a substitute computer readable form (CRF) copy of the 
"Sequence Listing"; 2) a substitute paper copy of the "Sequence 
Listing " , as well as an amendment directing its entry into the 
specification; AND 3) a statement that the content of the paper 
and computer readable copies are the same and, where applicable, 
include no new matter, as required by 37 C.F.R. §1.821 (e) or 
§1. 821(f) or §1. 821(g) or §1. 825(b) or §1. 825(d). 

Please amend the specification as follows: 

In the Specification 

Please delete the previously submitted paper copy of the 
"Sequence Listing" inserted after page 71. 

Please insert after page 71 the amended paper copy of the 
"Sequence Listing", attached to hereto as Exhibit B. 

REMARKS 

In response to the May 24, 2000 Notice, applicants submit 
herewith a computer readable form (CRF) copy of a revised 
Sequence Listing which is set forth on the newly submitted paper 
copy of revised Sequence Listing. 

Applicants further submit a paper copy of the revised Sequence 
Listing, attached hereto as Exhibit B, and a Statement in 
Accordance with 37 C.F.R. §1. 821(f), attached hereto as Exhibit 
C, certifying that the computer readable form as required by 3 7 
C.F.R. §1.821 (e) is identical to the paper copy of Sequence 
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Listing . 

Applicants are submitting herewith a computer diskette containing 
a C.F.R. of the sequence listing. 

No fee is deemed necessary in connection with the filing of this 
Amendment. However, if any fee is required, authorization is 
hereby given to charge the amount of any such fee to Deposit 
Account No. 03-3125. 



Respectfully submitted, 
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^296^ SEQUENCE LISTING 



;i) GENERAL INFORMATION: 

"'(A) NAME*: Max- Planck-Gesellschaf t zur Foerderung der 
\ Wissenschaf ten e.V. 

(B) STREET: none 

(C) CITY: Berlin 

(E) COUNTRY: DE 

(F) POSTAL CODE (ZIP) : none 

(A) NAME : CNRS 

(B) STREET: rue de la cardonille 

(C) CITY: Montpellier Cedex 05 

(E) COUNTRY: FR 

(F) POSTAL CODE (ZIP): 34094 

Mil TITLE OF INVENTION: Nucleic acid, molecules coding for mammalian 
UX) tumor suppressor proteins and methods for their isolation 

(iii) NUMBER OF SEQUENCES: 17 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1-0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: "2790 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(3) LOCATION: 542. .2545 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GAATTCGGGA GAGCAAGCGG GCATCTCCTG GGCGCCGTCA TGGCTGCTTA GGCTGCGCTG 
CCTGCGGATC GCGGATCCGG GATCGGAGAT CTGACGGCGA CGCCTGAGTC CGGCTAGGGT 
AGGTCTGGGT TGGAGTCTGT GCCTGCTTCC TTGGCGTGTG GTTGTTCCTG CTTGATTGCT 
TCAGCGTGCC ATCGGCTTCG TATTTGCATA GGAGTCAGAG GAGTTAATCT TGTCTCCTCG 
AAGATAGACT CTCATGGTTT ATGATCCATC TCTGTGAGAA GACTTTATTT GTCTGTCTCT 

\ 
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TCTCACAGGT TTGAGTCTTC AGACTTCTAC AGAACTCCAT AATATCTGCC TCACAGCTGG 

CTTTCCTGCT CTCACAGAAG ATACCCAGCT ATTGTGCTCT GGATCTCTCC TGGCTGCTAG 

GCTGTAGCGC TGCCTTTCTG GAGTCAGGCT GTAGTGACTC CCCACCTTCT TTCTGTCTGG 

GCTTAAATGG CACAGCAGTT CCTCAGCACA TCTGAAGAAG AAAGTGTGAG AACCAAAGGC 

r ATC" GCT CCA TTC CGC TGT CAA AAA TGT GGC AAG TCC TTC GTC ACC 

C ATG GCT CUA i i<- ^ . L s ?he Val xhr 

Met Ala Pro Phe Arg Cys v»ln Lys cys i»-y ny 

1 5 10 

CTC GAG AAG TTC ACC ATT CAC AAT TAT TCC CAC TCC AGG GAG CGC CCA 
Tel Su Phe Thr He His Asn Tyr Ser His Ser Arg Glu Arg Pro 



20 



„ Tr - r aar rCT GAG TGT GGC AAA GCC TTC GTC TCC AAG TAT 

III £Js If, S £ SI SS Cys 01, L». «» v.! Ser Ly» tyr 

35 40 



AAG CTG ATG AGA CAC ATG GCC ACA CAC TCG CCA CAG AAG ATT CAC CAG 
L^s SS Set Lg His Met Ala Thr His Ser Pro Gin Lys He Hxs Gin 



50 55 



TCT ACT CAC TGT GAG AAG ACA TTC AAC CGG AAG GAC CAC CTG AAG AAC 
Cys ?Tr Ss Cys Glu Lys Thr Phe Asn Arg Lys Asp Hxs Leu Lys Asn 
65 70 75 

TAP CTC CAG ACC CAC GAT CCC AAC AAG ATC TCC TAC GCG TGT GAC GAT 
25 Zn £n T*hr His Asp Pro Asn Lys He Ser Tyr Ala Cys Asp AS g P 



80 



85 9° 



TGC GGC AAG AAG TAC CAC ACC ATG CTG GGC TAC AAG AGG CAC CTG GCC 
Ifs fly %s Tyr His Thr Met Leu Gly Tyr Lys Arg Hxs Leu Ala 

lit S5 =2 S S£ £ SS % 2S ^ Z £ iS SS 



115 



rTr GGG AGC ACC GAG GTC CTG CTG GAC CAC CTC AAG TCT CAC GCG 
SS SS Gly Ser tS Glu Val Leu Leu Asp His Leu Lys Ser Hxs Ala 
130 135 

TAA GAA AAG GCC AAC CAG GCA CCC AGG GAG AAG AAA TAC CAG TGC GAC 
Gxt S5 £Js S Asn Gin Ala Pro Arg Glu Lys Lys Tyr Gin Cys Asp 

150 13:3 



145 



CAC TGT GAT AGA TGC TTC TAC ACC CGG AAA GAT GTG CGT CGC CAC CTG 
£5 Cys Isl frg Cys Phe Tyr Thr Arg Lys Asp Val Arg Arg-Hxs Leu 
160 165 

CTC CTC CAC ACA GGA TGC AAG GAC TTC CTG TGT CAG TTC TGT GCC CAG 
G va S hTs fhr Gly Cys Lys Asp Phe Leu Cys Gin Phe Cys Ala Gin 
190 185 

Ar* TTT GGG CGC AAA GAC CAC CTC ACT CGT CAC ACC AAG AAG ACC CAC 
Arg HI aTy Arg Lys Asp His Leu Thr Arg His Thr Lys Lys Thr Hxs 

195 200 
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970 



1013 



1066 



1114 



1152 



TCC C*G GAG CTG ATG CAA GAG AAT ATG CAG GCA GGA GAT TAC CAG AGu 1210 
S = r c"n du Leu Met Gin Glu Asp. Met Gin Ala Gly Asp Tyr Gin Ser 
210 215 220 

, aT T TC CAA CTC ATT GCG CCT TCA ACT TCG TTC CAG ATA AAG GTT GAT 1253 
Asp Phe Gin Leu He Ala Pro Ser Thr Ser Phe Gin He Lys Val Asp 
* 225 230 235 

C~C ATG CCT CCT TTC CAG CTA GGA GCG GCT CCC GAG AAC GGG CTT GAT 130 6 

P-o Met Pro Pro Phe Gin Leu Gly Ala Ala Pro Glu Asn Gly Leu Asp 
2~40 245 250 255 

GGT GGC TTG CCA CCC GAG GTT CAT GGT CTA GTG CTT GCT GCC CCA GAA 1354 
Glv Gly Leu Pro Pro Glu Val His Gly Leu Val Leu Ala Ala Pro Glu 
Y y 260 265 270 

GAA GCT CCC CAA CCC ATG CCG CCC TTG GAG CCT TTG GAG CCT TTG GAG 1402 
Glu Ala Pro Gin Pro Met Pro Pro Leu Glu Pro Leu Glu Pro Leu Glu 
275 230 . 285. 

CCT TTG GAG CCT TTG GAG CCG ATG CAG TCT TTG GAG CCT TTG CAG CCT 14 50 

?-o Leu Glu Pro Leu Glu Pro Met Gin Ser Leu Glu Pro Leu Gin Pro 
290 295 300 

TTG GAG CCG ATG CAG CCT TTG GAG CCA ATG CAG CCT TTG GAG CCG ATG 14 93 

Leu Glu Pro Met Gin Pro Leu Glu Pro Met Gin Pro Leu Glu Pro Met 
305 310 315 

CAG CCT TTA GAG CCT TTG GAG CCT CTG GAG CCG ATG CAG CCT TTG GAG 154 6 

Gin Pro Leu Glu Pro Leu Glu Pro Leu Glu Pro Met Gin Pro Leu Glu 
320 325 330 335 

CCG ATG CAG CCT TTG GAG CCT ATG CAG CCA ATG CTG CCA ATG CAG CCA 1594 
oro Met Gin Pro Leu Glu Pro Met Gin Pro Met Leu Pro Met Gin Pro 
340 345 350 

ATG CAG CCA ATG CAG CCA ATG CAG CCA ATG CTG CCA ATG CAG CCA ATG 164 2 

Met Gin Pro Met Gin Pro Met Gin Pro Met Leu Pro Met Gin Pro Met 
355 360 365 

CTG CCA ATG CAG CCA ATG CAG CCA ATG CAG CCA ATG CTG CCA ATG CCA ' 1690 

Leu Pro Met Gin Pro Met Gin Pro Met Gin Pro Met Leu Pro Met Pro 
370 375 380 

GAG CCG TCT TTC ACT CTG CAC CCT GGC GTA GTT CCC ACC TCT CCT CCC 17 33 

Glu °ro Ser Phe Thr Leu His Pro Gly Val Val Pro Thr Ser Pro Pro 
385 390 395 

CCA ATT ATT CTT CAG GAG CAT AAG TAT AAT CCT GTT CCT ACC "-TCA TAT 178 6 

Pro He lie Leu Gin Glu Kis Lys Tyr Asn Pro Val Pro Thr Ser Tyr 
400 "05 410 415 

GCC CCA TTT GTA GGC ATG CCC GTC AAA GCA GAT GGC AAG GCC TTT TGC 18 34 

Ala P-o =he Val Gly Met Pro Val Lys Ala Asp Gly Lys Ala Pne Cys 
420 425 430 

.AAC GTG GGT TTC TTT GAG GAA TTT CCT CTG CAA GAG CCT CAG GCG CCT 18 3 2 

Asn Val Gly Phe Phe Glu Glu Phe Pro Leu Gin Glu Pro Gin Ala Pro 



435 



440 4 4 5 



TTC AAG TTC A AC CCA TGT TTT GAG ATG CCT ATG GAG C-GG TTT GGG AAA 
i£ ?le Asa Pro Cys Phe Gla Met Pro Met Glu Gly Phe Gly Lys 
450 455 40 ° 

PTC ACC CTG TCC AAA GAG CTG CTG GTA GAT GCT GTG AAT ATA GCC ATT 
S£ ?hr Leu III Lys Glu Leu Leu Val Asp Ala Val Asn lie Ala lie 



465 



470 475 



CCT GCC TCT CTG GAG ATT TCC TCC CTA TTG GGG TTT TGG CAG CTC CCC 
Pro 111 III III Glu He Ser Ser Leu Leu Gly Phe Trp Gin Leu Pro 
480 485 490 

CCT CCT ACT CCC CAG AAT GGC TTT GTG AAT AGC ACC ATC CCT GTG GGG 
Pro Pro tS Pro Gin Asn Gly Phe Val Asn Ser Thr lie Pro Val Gly 
500 505 3iu 

rrr CCC GAG CCA CTG CCC CAT AGG ATA ACC TGT CTG GCG CAG CAG CAG 
Pro Gly Gil fro Leu Pro His Arg He Thr Cys Leu Ala Gin, Gin Gin 
515 520 

rra CCG CCA CTG CCG CCG CCA CCA CCG CTG CCA CTG CCA CAG CCA CTG 
fro fro Pro Leu Pro Pro Pro Pro Pro Leu Pro Leu Pro Gin Pro Leu 
530 535 540 

rr , rTr CCA CAG CCA CTA CCA CAG CCA CAG ATG CAG CCA CAG TTT CAG 
Pro vll Pro 111 Pro Leu Pro Gin Pro Gin Met Gin Pro Gin Phe Gin 



545 



550 555 



TTG CAG ATC CAG CCC CAG ATG CAG CTA CCA CAG CTG CTG CCG CAA CTG 
III Gil 111 Gin Pro Gin Met Gin Leu Pro Gin Leu Leu Pro Gin Leu 
560 565 570 575 

CAA CCT CAG CAG CAG C&T GAT CCT GAG CCA GAG CCA GAG CCA GAG CCA 
SS Pro Gil Gin Gin Pro Asp Pro Glu Pro Glu Pro Glu Pro Glu Pro 
580 585 

GAG CCA GAG CCA GAG CCA GAG CCG GAA CCG GAA CCG GAG CCA GAG CCA 
Su Pro" Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro 
595 600 605 

TAG CCA GAA CCA GAG CCA GAG GAA GAA CAG GAA GAG GCA GAA GAA GAG 
Gil Pro SS Pro Glu Pro Glu Glu Glu Gin Glu Glu Ala Glu Glu Glu 
610 615 520 

GCA GAG GAA GGA GCA GAG GAA GGA GCA GAA CCA GAG GCA CAG GCA GAA 
A i a" Gil SS Gly Ala Glu Glu Gly Ala Glu Pro Glu Ala Gin Ala Glu 
~ 625 630 635 

CAA GAG GAA GAG GAA GAG GAA GCG GAA GAG CCA CAG CCA GAA GAA GCC 
Gil Glu Glu Glu Glu Glu Ala Glu Glu Pro Gin Pro Glu Glu Ala 



640 



645 650 655 
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2506 



CAA ATA. GCA GGA CTC GTC TAT AAG AAA TGG ACA GTT TAG TTCCTCTTCT 2555 
Gin He Ala Glv Leu Val Tyr Lys Lys Trp Thr Val * 
660 665 

TGTTAGCTTA CTCTGTAGTT TCTTCTTCTT GTTGCCCATT GTGTAGCTTT ATAGAGTGTG- 2615 



ACGCTATTGA TGTCTCCATT TTTTAAAGTG AATTTAAATG TACTGTTCAA TATTTTTCAT 2 67 5 

GTGATGTTGT TCCAATGTGA GTTACGACTT CATTTATCTT AAAGACAaLaA CTGGTTGTCA 27 3 5 

GTCATATCTG ACAGAAGAAA GAAATCACTG TGTAACCAAG C C AT AT AG C G GCCGC 27 90 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Pro Phe Arg Cys Gin Lys Cys Gly Lys Ser Phe Val Thr Leu 

1 5 10 • : 15 

Glu Lys Phe Thr lie His Asn Tyr Ser His Ser Arg Glu Arg Pro Phe 
20 25 30 

Lys Cys Ser Lys Ala Glu Cys Gly Lys Ala Phe Val Ser Lys Tyr Lys 
35 40 45 

Leu Met Arg His Met Ala Thr His Ser Pro Gin Lys lie His Gin Cys 
50 55 60 

Thr His Cys Glu Lys Thr Phe Asn Arg Lys Asp His Leu Lys Asn His 
65 70 75 80 

Leu Gin Thr His Asp Pro Asn Lys He Ser Tyr Ala Cys Asp Asp Cys 
85 ' 90 95 

Gly Lys Lys Tyr His Thr Met Leu Gly Tyr Lys Arg His Leu Ala Leu 
100 105 HO 

His Ser Ala Ser Asn Gly Aso Leu Thr Cys Gly Val Cys Thr Leu Glu 
115 * 120 125 

Leu Gly Ser Thr Glu Val Leu Leu Asp His Leu Lys Ser His Ala Glu 
130 135 140 

Glu Lys Ala Asn Gin Ala Pro Arg Glu Lys Lys Tyr Gin Cys Asp His 
145 150 155 160 

Cys Asp Arg Cys Phe Tyr Thr Arg Lys Asp Val Arg Arg His Leu Val 
165 170 '175 

Val His Thr Gly Cys Lys Asp Phe Leu Cys Gin Phe Cys Ala Gin Arg 
180 185 190 

Phe Gly Arg Lys Aso His Leu Thr Arg His Thr Lys Lys Thr His Ser 
195 ~ 200 205 



Gin Glu Leu Met Gin Glu Asn Met Gin Ala Gly Asp Tyr Gin Ser Asn 



210 



215 



220 



oh- Gl" Leu lie Ala Pro Ser Thr 5er ?he Gin He Lys Val As? Pro 
225 " 230 235 ' 240 

M*t Pro Pro Phe Gin Leu Gly Ala Ala Pro Glu Asn Gly Leu Asp Gly 
245 250 255 

Glv Leu Pro Pro Glu Val His Gly Leu Val Leu Ala Ala Pro Glu Glu 
260 265 270 

Ala Pro Gin Pro Met Pro Pro Leu Glu Pro Leu Glu Pro Leu Glu Pro 
275 280 285 

Leu Glu Pro Leu Glu Pro Met Gin Ser Leu Glu Pro Leu Gin Pro Leu 
290 295 300 

Glu Pro Met Gin Pro Leu Glu Pro Met Gin Pro Leu Glu Pro Met Gin 
305 310 315 320 

P-o Leu Glu Pro Leu Glu Pro Leu Glu Pro Met Gin Pro Leu Glu Pro 
325 ' 330 335 

Met Gin Pro Leu Glu Pro Met Gin Pro Met Leu Pro Met Gin Pro Met 
340 345 350 

Gin Pro Met Gin Pro Met Gin Pro Met Leu Pro Met Gin Pro Met Leu 
355 360 365 

Pro M-t Gin Pro Met Gin Pro Met Gin Pro Met Leu Pro Met Pro Glu 
370 375 380 



Pro Ser Phe Thr Leu His Pro Gly Val Val Pro Thr Ser Pro Pro Pro 
385 350 395 400 



He He Leu Gin Glu His Lys Tyr Asn Pro Val Pro Thr Ser Tyr Ala 
405 410 415 

Pro Phe Val Gly Met Pro Val Lys Ala Asp Gly Lys Ala Phe Cys Asn 
420 425 430 

Val Gly Phe Phe Glu Glu Phe Pro Leu Gin Glu Pro Gin Ala Pro Leu 
435 440 445 

Lys Phe Asn Pro Cys Phe Glu Met Pro Met Glu Gly Phe Gly Lys Val 
450 455 460 

Thr Leu Ser Lvs Glu Leu Leu Val Asp Ala Val Asn He Ala He Pro 
465 ' 470 475 ^ 480 

Ala Ser Leu Glu He Ser Ser Leu Leu Gly Phe Trp Gin Leu Pro Pro 
485 490 495 



Pro Thr Pro Gin Asn Gly Phe Val Asn Ser Thr He Pro Val Gly Pro 
500 505 510 

Glv Glu Pro Leu Pro His Arg He Thr Cys Leu Ala Gin Gin Gin Pro 
515 520 525 



d_ 0 o ro »r 0 Pro Pro Pro Pro Leu Pro Leu Pro GIp. Pro Leu Pro 

530 535 540 

Val -o Gla Pro Leu Pro Gin Pro Gin Met Gin Pro Gin Phe Gin Leu 
= ,= 550 555 30U 

Gin lie Gin Pro Gin Met Gin Leu Pro Gin Leu Leu Pro Gin Leu Gin 
565 570 

p.o Gin Gin Gin Pro Asp Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu 
580 585 590 

oro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu 

595 600 605 



oro Glu Pro Glu Pro Glu Glu Glu Gin Glu Glu Ala Glu Glu Glu Ala 
* 610 615 620 

Glu Glu Gly Ala Glu Glu Gly Ala Glu Pro Glu Ala Gin Ala Glu Glu 

625 630 635 640 

Glu Glu Glu Glu Glu Glu Ala Glu Glu Pro Gin Pro Glu Glu Ala Gin 
645 650 6o5 

He Ala Gly Leu Val Tyr Lys Lys Trp Thr Val * 
660 665 

(2) INFORMATION FOR SEQ ID MO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 1 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATGTCTCGAG GCCTTTGCGG CCGCTATANN NNNNNN 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(iii) HYPOTHETICAL: YES 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AGGCCATGG 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note- "N at position 3 is A or G . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CCNCCATGG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

His Ser Arg Glu Arg Pro Phe Lys Cys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(iii) HY POT HETI CAL : YES 




(ix) FEATURE: 

(A) NAME /KEY : Cross-links 
(3) LOCATION: 2 

(D) OTHER INFORMATION : /note- "X at position 2 is S or T." 

(ix) FEATURE: 

(A) NAME /KEY: Cross-links 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note- "X at position 7 is F or Y . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

His Xaa Gly Glu Lys Pro Xaa Xaa Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES5: single 

(D) TOPOLOGY: linear 



HYPOTHETICAL: YES 

SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ser Pro Gin Lys 
5 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Lys Lys Trp Thr 
1 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(iii) 

(xi) 

His 
1 



(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GTGATGGCGG CCGCCATTCC GCTGTCAAAA ATGTG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: -11: 

CCGCGCCTCG AGGGTCTTCT TGGTGTGACG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GCGGCCGCAG AGCCGTCTTT CACTC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CCGCGCCTCG AGAACTGTCC ATTTCTTATA GAC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 



33 



(3) TYPHI : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: 3EQ ID NO: 14: 
ATAGCAGTGA GTGCTGTG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTTTCTTTTC AGGGACTC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2334 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL- NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 803. .2192 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 
CGGTTCTTTC AATTCAGAAT TTGTTTTAGG TTCTGTTATT GCATAGATTT GCATACCTGT 
TTTATGGTAT TTTAATACTG TTGGTTTTAA AAAAT AC CAT TTCCTCTGAG TGCTGTTCTG 
AATATATTAT GTAAGCAATT TTGTGTGTTC TTTTTTTTCC AC T T GC AT AA AGCAGGGGAA 
AAGTTGAGAG TTTTTCTTAA TCCAGTCCCA AG T AG G AC AA AG GAT AT GAG TGTTTAAAGA 
TCATCTATTA AAAT G C AT G A AAAAAC AC T A GAAAATCTCC TGTGCACATC GCCAGTCGTG 
TGTGTGCTCT AGAAGTGAAG TTCAGGGGGT AACATAATGG AGGAATGTTT TCCTAGCTTC 
ATTCCCTGAC GATGTACAAG GTCTCTTCTC ACAGGTTTGA ATCTTCAGAC AAACTTCTGG 
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GAGGACTGGG AGGACTCGGT CCCTGCCTCG CAGCAGA.TGT TCCCTGTCAC TCAGTAGCCA 

ATCCGGGGGA CCCAGGACAT GCCCCAGCTA TAGTGATGCA GATTACCTTT CTGGTCCTGA 

ATCGCACCTG TGCCTCGAGA CTTTCTCCCC TCAGCTTGAG ACTGCATGTA AACTGGGATG 

TGTGAAAGCA GGAAGCAAAG CTAGTGACAG CTGAGAGGTC CATGTCTGGG TAGAACCAGG 

CCCACGATGC TGCCTCTCCC GTGGTCTGGA GTTCAGCTGC AGGGACTCTG CTGATTGGCC 

CAGCACCATC GTTCTGTTTG TGCTTAAATG GCACAGCATT TGGTCAGCAC AT C T G AAAAG 

rAACCTGTGA GAAGCAAAGC CC ATG GCC ACG TTC CCC TGC CAG TTA TGT GGC 
GAAGGTGTGA GAAGCAAAOO ^ ^ ^ ^ ^ ^ ^ Qys Qly 

'"x 5 10 

AAC ACG TTC CTC ACC CTG GAG AAG TTC ACG ATT CAC AAT TAT TCC CAC 
Thr Phe Leu Thr Leu Glu Lys Phe Thr lie His Asn Tyr Ser His 
15 20 ' 

TCC AGG GAG CGG CCG TAC AAG TGT GTG CAG CCT GAC TGT GGC AAA GCC 
Ser Axg Glu Arg Pro Tyr Lys Cys Val Gin Pro A 3 p Cys Gly Lys Ala 

30 35 

TTT GTT TCC AGA TAT AAA TTG ATG AGG CAT ATG GCT ACC CAT TCT CCC 
HI Val Ser Arg Tyr Lys Leu Met Arg His Met Ala Thr His Ser Pro 
45 50 5 ° 

CAG AAA TCT CAC CAG TGT GCT CAC TGT GAG AAG ACG TTC AAC CGG AAA 
Gin Lys III His Gin Cys Ala His Cys Glu Lys Thr Phe Asn Arg Lys 
60 65 ™ 

CAC CAC CTG AAA AAC CAC CTC CAG ACC CAC GAC CCC AAC AAA ATG GCC 
to His Leu L^s Asn His Leu Gin Thr His Asp Pro Asn Lys Met Ala 
75 80 85 

TTT GGG TGT GAG GAG TGT GGG AAG AAG TAC AAC ACC ATG CTG GGC TAT 
III Gly Cys Glu Glu Cys Gly Lys Lys Tyr Asn Thr Met Leu Gly Tyr 



95 



AAG AGG CAC CTG GCC CTC CAT GCG GCC AGC AGT GGG GAC CTC ACC TGT 
Zs Arg H^s Leu Ala Leu His Ala Ala Ser Ser Gly Asp Leu Thr Cys 

110 115 

GGG GTC TGT GCC CTG GAG CTA GGG AGC ACC GAG GTG CTA CTG GAC CAC 
Gly val Cys Ala Leu Glu Leu Gly Ser Thr Glu Val Leu Leu Asp His 
125 130 I 35 

CTC AAA GCC CAT GCG GAA GAG AAG CCC CCT AGC GGA ACC AAG GAA AAG 
HI Lys Ala His Ala Glu Glu Lys Pro Pro Ser Gly Thr Lys Glu Lys 

145 150 



140 



*AG CAC CAG TGC GAC CAC TGT GAA AGA TGC TTC TAC ACC CGG AAG GAT 
aau ^ Tyr Thr Arg Lys Asp 

Lys His Gin Cys As? his Cys Glu A-g <~ys ^ y ^ 



430 

540 

500 

660 

720 

730 

832 

880 

928 

976 

1024 

1072 

1120 

1168 

1216 

1254 

1312 



155 160 



GTG CGA CGC CAC CTG GTG GTC CAC ACA GGA TGC AAG GAC TTC CTG TGC 13 60 

Van A~g Arg His Leu Val Val His Thr Gly Cys Lys Asp ?he Leu Cys 
175 130 ' 135 

CAG TTC TGT GCC CAG AGA TTT GGG CGC AAG GAT CAC CTC ACC CGG CAT 14 03 

Gin Phe Cvs Ala Gin Arg ?he Gly Arg Lys Asp His Leu Thr Arg His 
190 195 200 

ACC AAG AAG ACC CAC TCA CAG GAG CTG ATG AAA GAG AGC TTG CAG ACC 14 5 6 

Thr Lys Lys Thr His Ser Gin Giu Leu Met Lys Glu Ser Leu Gin Thr 
205 210 215 

GGA GAC CTT CTG AGC ACC TTC CAC ACC ATC TCG CCT TCA TTC CAA CTG 150 4 

Gly Asp Leu Leu Ser Thr Phe His Thr He Ser Pro Ser Phe Gin Leu 
220 225 230 

AAG GCT GCT GCC TTG CCT CCT TTC CCT TTA GGA GCT TCT GCC CAG AAC 1552 
Lys Ala Ala Ala Leu Pro Pro Phe Pro Leu Gly Ala Ser Ala. Gin Asn 
235 * 240 245 ' 250 

GGG CTT GCA AGT AGC TTG CCA GCT GAG GTC CAT AGC CTC ACC CTC AGT 1600 
Gly Leu Ala Ser Ser Leu Pro Ala Glu Val His Ser Leu Thr Leu Ser 
255 260 265 

CCC CCA GAA CAA GCC GCC CAG CCT ATG CAG CCG CTG CCA GAG TCC CTG 164 3 

Pr-o Pro Glu Gin Ala Ala Gin Pro Met Gin Pro Leu Pro Glu Ser Leu 
270 275 280 

GCC TCC CTC CAC CCC TCG GTA TCC CCT GGC TCT CCT CCG CCA CCC CTT 1696 
Ala Ser Leu His Pro Ser Val Ser Pro Gly Ser Pro Pro Pro Pro Leu 
285 290 295 

CCC AAT CAC AAG TAC AAC ACC ACT TCT ACC TCA TAC TCC CCA CTT GCA 17 4 4 

Pro Asn His Lys Tyr Asn Thr Thr Ser Thr Ser Tyr Ser Pro Leu Ala 
300 305 310 

AGC CTG CCC CTC AAA GCA GAT ACT AAA GGT TTT TGC AAT ATC AGT TTG 17 92 

Ser Leu Pro Leu Lys Ala Asp Thr Lys Gly Phe Cys Asn He Ser Leu 
315 320 325 330 

TTT GAG GAC TTG CCT CTG CAA GAG CCT CAG TCA CCT CAA AAG CTC AAC 18 4 0 

Phe Glu Asp Leu Pro Leu Gin Glu Pro Gin Ser Pro Gin Lys Leu Asn 
335 340 345 

CCA GGT TTT GAT CTG GCT AAG GGA AAT GCT GGT AAA GTA AAC CTG CCC 18 8 8 

Pro Gly Phe Asp Leu Ala Lys Gly Asn Ala Gly Lys Val Asn-,Leu Pro 
350 355 360 

AAG GAG CTG CCT GCA GAT GCT GTG AAC CTA ACA ATA CCT GCC TCT CTG 193 6 

Lys Glu Leu Pro Ala Asp Ala Val Asn Leu Thr He Pro Ala Ser Leu 
365 370 375 



GAC CTG TCC CCC CTG TTG GGC TTC TGG CAG CTG CCC CCT CCT GCT ACC 
Aso Leu Ser Pro Leu Leu Gly Phe Trp Gin Leu Pro Pro Pro Ala Thr 



193^ 



380 335 390 

CAA AAT ACC TTT GGG AAT AGC ACT CTT GCC CTG GGG CCT GGG GAA TCT 20 32 

Gin ^sn Thr ?he Gly Asn Ser Thr Leu Ala Leu Gly. Pro Gly Glu Ser 
395 400 405 410 

TTG CCC CAC AGG TTA AGC TGT CTG GGG CAG CAG CAG CAA GAA CCC CCA 20 3 0 

Leu Pro His Arg Leu Ser Cys Leu Gly Gin Gin Gin Gin Glu Pro Pro 
415 420 425 

CTT GCC ATG GGC ACT GTG AGC CTG GGC CAG CTC CCC CTG CCC CCC ATC 212 8 

Leu Ala Met Gly Thr Val Ser Leu Gly Gin Leu Pro Leu Pro Pro He 
430 435 440 

CCT CAT GTG TTC TCA GCT GGC ACT GGC TCT GCC ATC CTG CCT CAT TTC 217 6 

Pro His Val Phe Ser Ala Gly Thr Gly Ser Ala He Leu Pro His Phe 
445 450 455 

CAT CAT GCA TTC AGA T AATTGATTTT TAAAGTGTAT TTTTCGTATT * 2222 
His His Ala Phe Arg 
460 

CTGGAAGATG TTTTAAGAAG CATTTTAAAT GTCAGTtACA ATATGAGAAA GATTTGGAAA 22 8 2 

ACGAGACTGG GACTATGGCT TATTCAGTGA TGACTGGCTT GAG AT G AT AA GA 2 334 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 63 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

(iij MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ala Thr Phe Pro Cys Gin Leu Cys Gly Lys Thr Phe Leu Thr Leu 
! 5 10 15 

Glu Lys Phe Thr He His Asn Tyr Ser His Ser Arg Glu Arg Pro Tyr 
20 25 30 

jln Pro Asp Cys Gly 
35 40 

Leu Met Arg His Met Ala Thr His Ser Pro Gin Lys Ser His-.Gln Cys 
en 55 60 



Lys Cys Val Gin Pro Asp Cys Gly Lys Ala Phe Val Ser Arg Tyr Lys 

40 45 



50 55 60 

Ala His Cys Glu Lys Thr Phe Asn Arg Lys Asp His Leu Lys Asn His 
65 70 75 80 



Leu Gin Thr His Aso Pro Asn Lys Met Ala Phe Gly Cys Glu Glu Cys 
85 90 95 



Tv- Asn Th- Met Leu Giy Tyr Lys Arg His Leu Ala Leu 
Giy Lys Lys iy. Asn m_ - 11Q 

100 1U= 

Hi. *W U. ,« ,« Thr «. =1, v.l ey. ^ «, 

115 

Thr Glu Val Leu Leu As? His Leu Lys Ala His Ala Glu 



135 



140 



Leu Giy Ser 
130 

c . riv Thr L vs Giu Lys Lys His Gin Cys Asp His 
Glu Lvs Pro Pro Ser Giy mr uy=> ^-u y * 

145 ' 150 155 



C vs Glu Arg Cys Phe Tyr Thr Ar g Lys Asp Val Ar g Arg His Leu Val 
165 170 

val His Thr Giy Cys Lys Asp Phe Leu Cys Gin Phe Cys Ala Gin Arg 

180 13:3 

acn Hls t» u Thr Arg His Thr Lys Lys Thr, His Ser 
Phe Giy Arg Lys Asp His —a in. g ^ 

195 20U . 

, T vtot- Tvs Glu S-r Leu Gin Thr Giy Asp Leu Leu Ser Thr 
Gin Glu Leu Met Lys blu 1— 

-•in 215 

. Tltt ?ro S e- Phe Gin Leu Lys Ala Ala Ala Leu Pro 

Phe His Thr He Ser Pro be. ^ 24Q 



230 



n Ton riv Ala 3— Ala Gin Asn Giy Leu Ala Ser Ser Leu 
Pro Phe Pro Leu Giy nia a-- 255 

245 250 

n n v»l His Se- Leu Thr Leu Ser Pro Pro Glu Gin Ala Ala 
Pro Ala Glu Val His ber -eu 



260 265 



Gin ?ro Met Gin Pro Leu Pro Glu Ser Leu Ala Ser Leu His Pro Ser 



275 



Val Ser Pro Giy Ser Pro Pro Pro Pro Leu Pro Asn His Lys Tyr Asn 



290 



295 



Thr Thr Ser Thr Ser Tyr Ser Pro Leu Ala Ser Leu Pro Leu Lys Ala 



305 



310 



Asa Thr Lys Giy Phe Cys Asn He Ser Leu Phe Glu Asp Leu Pro Leu 

325 330 

<~i„ r^r* Tpu Asn Pro Giy Phe Asp Leu Ala 
Gin Glu Pro Gin Ser Pro Gin lvs Lea Asn rro y ^ 

340 

Lys Giy Asn Ala Giy Lys Val Asn Leu Pro Lys Glu Leu Pro Ala Asp 

Ala Val Asn Leu Thr He Pro Ala Ser Leu Asp Leu Ser Pro Leu Leu 

370 j75 



GIv Phe Tro Gin Leu Pro Pro 

390 39d 



■j ro AT a Thr Gin Asa Thr Phe Giy Asn 

400 



385 



Ser Th 



r Leu Ala Leu Gly Pro Giy 
405 



Glu Ser Leu Pro His krg Leu Ser 



410 



415 



Cvs Leu Gly Gin Gin Gin Gin Glu Pre Pro Leu Ala Met Gly Thr Val 



420 



425 



Ser Leu Gly 
435 

Gly Thr Gly 
450 



Gly Gin Leu Pro Leu Pro Pro 



440 



He Pro His Val Phe Ser Ala 
445 



Ser Ala He Leu Pro His Phe His His Ala Phe Arg 



455 



460 
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